Syntactic Normalization Of Spontaneous Speech
نویسنده
چکیده
This paper presents some techniques that provide a standard parsing system for the analysis of ill-formed utterances. These techniques are feature generalization and heuristically driven deletions. PROBLEM Generally the development of grammars, formalisms and natural language processors is based on written language data or, sometimes, not real data at all, but invented 'example sentences'. This holds for both computational and general linguistics. Thus many parsing systems that work quite well for sentences like la. and lb. fail, if they get applied to the authentic data in 2a. and 2b.: la. lb. 2a. 2b. die Grundform ist nicht eckig the basic form is not angular das blaue habe ich als Waage auf dem gr0nen llegen I have got the blue one lying upon theOAT greenOAT OneDAT like a balance die die Grund die Grundform sind is nich Is nleh eckig the the basic the basic form are is not is not angular das blaue hab ich ale Waage aul das gr0ne liegen I have got the blue one lying upon theACC greenAcc oneACC like a balance To native recipients the utterances in 2. appear to be more or less defective, but interpretable expressions. Moreover, the interpretation of 2a. or 2b. might require even less effort than, for instance, understanding an absolutely grammatical 'garden path sentence'. Since utterances like 2a. and 2b. occur quite frequently in spontaneous speech, an approach to parsing everyday language has to provide techniques that cover repairs, ungrammatical repetitions (2a.), case-assignment violation (2b.), agreement errors and other phenomena that have been summarized under the label 'iU-formed' in earlier research (Kwasny/Sondheimer "1 am indebted to Dafydd Gibbon, Hans Karlgren and Hannes Rieser for their comments on earlier drafts of this paper. Though the present paper will adhere to this terminology , it should be emphasized that it is not presupposed that there are any general criteria precise enough to tell us exactly whether some utterance is 'ill-formed' relative to a natural language. Let us assume, instead, that some utterance U is 'ill-formed (defective, irregular ....) with respect to a grammar G' iff U is not a sentence of the language specified by G. Since, for instance, repairs exhibit a high degree of structural regularity (el. Schegloff et al. 1977, Lever 1983, Kindt/Laubenstein in preparation) one might prefer to describe them within the grarxmaar and not within some other domain (e.g. within a pro-duction/perception model). Therefore the concept 'ill-formed' is …
منابع مشابه
مقایسه ویژگیهای صرفی– نحوی گفتار بیماران ناروان بی دستور با افراد سالم فارسی زبان
Background and purpose: The main features of non-fluent aphasia are inadequate production, limited vocabulary and agrammatism. Such patients have deficits in sentence comprehension and production and their speech is short and telegraphic. In this study, morphological and syntactic errors in speech of non-fluent aphasia were compared with those in healthy subjects. Materials and methods: A ...
متن کاملSpeech timing patterning as an indicator of discourse and syntactic boundaries
Although the perceptual reality of speech rhythm has not been unambiguously observed in acoustic correlates, speech has always been considered as rhythmic one way or another. This study looks at speech rhythm in spontaneous speech and its relationship to discourse and syntactic units. Two four-frame comic strips were used to elicit speech, and syllable onsets were measured from waveform/spectro...
متن کاملبررسی تأثیر پیچیدگی نحوی ساختار گروه اسمی و فعلی بر وقوع لکنت در کودکان لکنتی پیشدبستانی ۶-۴ سال فارسیزبان
Objective: The purpose of the present research was to investigate the effect of syntactic complexity of noun phrase and verb phrase on the occurrence of stuttering in 4-6 year Persian speaking children with stuttering. Materials & Methods: This descriptive-analytic research was done on 15 stuttering children, consisting of 12 boys and 3 girls, 4 to 6 years old monolingual Persian speaking wh...
متن کاملProsodic and syntactic structures in spontaneous English speech
In this paper we examine prosodic and syntactic structures of spontaneous English speech. By wavelet-based analysis, the prosodic structure of speech can be visually represented as a tree diagram. Combined with automatic syntactic parsing, this enables a novel method to compare prosodic and syntactic hierarchical structures in spoken language. In our research we segmented a sample of spontaneou...
متن کاملThe Relationship between Syntactic and Lexical Complexity in Speech Monologues of EFL Learners
: This study aims to explore the relationship between syntactic and lexical complexity and also the relationship between different aspects of lexical complexity. To this end, speech monologs of 35 Iranian high-intermediate learners of English on three different tasks (i.e. argumentation, description, and narration) were analyzed for correlations between one measure of sy...
متن کاملProsodic and syntactic segmentation of spontaneous speech: A preliminary study
In this paper we examine prosodic and syntactic segmentation of spoken Finnish language. Syntactic sentence or clause is generally mentioned as one of the basic units of language, but it can be questioned whether it is a good unit for analysing the structure of spontaneous speech. By wavelet-based analysis, the prosodic structure of speech can be represented as a tree diagram, making it possibl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1990